Use log-scaled quantile sketch budgets and rank-based accuracy checks by RAMitchell · Pull Request #12129 · dmlc/xgboost

RAMitchell · 2026-03-25T16:08:16Z

Summary

This PR aligns quantile sketch sizing more closely with the single-machine algorithm and updates the test suite to validate rank-error guarantees instead of cut-value deltas.

The main functional change is on the CPU distributed sketch path: we now track the number of represented elements per feature, serialize those counts through the distributed sketch payload, and recompute SketchSummaryBudget(...) after merge/prune using the summed per-feature counts. This changes the distributed CPU merge budget from a fixed O(1 / eps) cap to the same O(log n / eps) budget shape used by the underlying sketch.

In addition, this PR cleans up related sizing paths and strengthens quantile accuracy coverage across C++ and Python.

What This Changes

track represented element counts in WQuantileSketch
serialize per-feature element counts in the CPU distributed sketch payload
recompute the CPU distributed merge/prune budget from summed per-feature counts using SketchSummaryBudget(...)
use the same summary-budget helper in the CPU sorted-column ingestion path
preserve exact weighted values in the sorted sketch when the budget can retain every unique value
deduplicate sketch-budget logic by routing related CPU/GPU helper paths through the shared budget helpers

Test Changes

replace CPU distributed cut-to-cut comparisons with rank-error validation
add sparse row-split distributed tests where per-feature counts vary across both features and workers
add deterministic sorted weighted exact-cut coverage
align local GPU quantile tests with the same rank-based validation contract and shared weighted tolerance
add shared Python rank-error validation helpers and use them in QuantileDMatrix / quantile-cut tests

Testing

Ran locally:

./build-cpu/testxgboost --gtest_filter='Quantile.*:HistUtil.*'
./build-cuda-local/testxgboost --gtest_filter='HistUtil.*:GPUQuantile.*'
pytest tests/python/test_data_iterator.py tests/python/test_quantile_dmatrix.py tests/python/test_updaters.py -k "test_data_iterator or test_training or test_ref_quantile_cut or test_get_quantile_cut"

Notes

This PR is no longer limited to CPU distributed merge/prune only. It now includes:

the CPU distributed log n / eps budget plumbing
the sorted weighted exact-summary fix
shared rank-based validation updates across C++, GPU coverage, and Python

…uantile-logn-budget # Conflicts: # tests/cpp/common/test_hist_util.cu

…uantile-logn-budget # Conflicts: # tests/cpp/common/test_hist_util.cc # tests/cpp/common/test_hist_util.cu # tests/cpp/common/test_hist_util.h

Copilot

Pull request overview

This PR updates quantile sketch budgeting to follow the same O(log n / eps) summary-size behavior as the single-machine sketch (including distributed CPU merge/prune), and refreshes test coverage to validate the rank-error contract instead of comparing cut values directly.

Changes:

Track per-feature represented element counts in WQuantileSketch, serialize them in the distributed CPU sketch allreduce payload, and recompute merge/prune budgets from those counts.
Route multiple CPU/GPU sketch sizing paths through shared budget helpers (SketchSummaryBudget), including the GPU intermediate prune target.
Replace/extend C++ and Python tests to use rank-based cut validation (plus exact-cut coverage when the budget can retain all unique values).

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`src/common/quantile.h`	Adds element-count tracking to `WQuantileSketch` and an exact-summary fast path for sorted weighted input.
`src/common/quantile.cc`	Extends distributed sketch payload to include element counts and uses `SketchSummaryBudget` during merge/prune and sorted ingestion.
`src/common/quantile.cu`	Uses `SketchSummaryBudget` for GPU intermediate pruning instead of a local helper.
`src/common/quantile.cuh`	Removes `IntermediateNumCuts()` helper (now replaced by shared budget helper usage).
`src/common/hist_util.cu`	Switches sample-cut sizing to `SketchSummaryBudget`.
`tests/cpp/common/test_hist_util.h`	Tightens/aligns rank-error thresholds, updates exact-value validation, and adds a weight-aware validation wrapper.
`tests/cpp/common/test_hist_util.cu`	Uses the new weight-aware validation wrapper for GPU sketch tests.
`tests/cpp/common/test_hist_util.cc`	Adjusts rank-error validation for weighted CPU cases and adds a sorted weighted exact-cut regression test.
`tests/cpp/common/test_quantile.cc`	Reworks distributed CPU quantile tests to validate rank error (row/column split + sparse count skew).
`tests/cpp/common/test_quantile.cu`	Aligns distributed GPU weighted tolerance usage with the shared weighted threshold.
`python-package/xgboost/testing/quantile_dmatrix.py`	Adds shared Python rank-error validation helpers and uses them in reference-cut checks.
`python-package/xgboost/testing/updater.py`	Adds rank-error assertions for `get_quantile_cut` device tests (numerical case).
`tests/python/test_data_iterator.py`	Replaces local rank-error helper with shared Python helper.
`tests/python/test_quantile_dmatrix.py`	Adds rank-error assertions for iterator-vs-array quantile cuts in training test.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/cpp/common/test_hist_util.h

src/common/quantile.h

tests/cpp/common/test_quantile.cc

trivialfis · 2026-04-02T09:42:10Z

python-package/xgboost/testing/quantile_dmatrix.py

+MAX_NORMALIZED_RANK_ERROR = 2.0
+MAX_WEIGHTED_NORMALIZED_RANK_ERROR = 14.0


Could you please provide some brief comments on utilities here?

I will do a bit more rewriting - also the weighted rank error of 14 is way larger than it should be really.

RAMitchell added 8 commits March 25, 2026 09:04

Use O(log n/eps) budget for CPU distributed sketch

96be74a

Test CPU quantile accuracy with rank-error bounds

7225a2b

Scope rank-error test bounds to CPU coverage

6229cef

Merge remote-tracking branch 'upstream/master' into cpu-distributed-q…

0a4ffaf

…uantile-logn-budget # Conflicts: # tests/cpp/common/test_hist_util.cu

Merge remote-tracking branch 'upstream/master' into cpu-distributed-q…

12e7137

…uantile-logn-budget # Conflicts: # tests/cpp/common/test_hist_util.cc # tests/cpp/common/test_hist_util.cu # tests/cpp/common/test_hist_util.h

Unify quantile rank-error checks and budgets

3ab77c8

Preserve exact weighted values in sorted sketch

d55e902

Add weighted and sparse quantile coverage

9ac6639

RAMitchell changed the title ~~[WIP] Increase CPU distributed quantile sketch budget to O(log n / eps)~~ Use log-scaled quantile sketch budgets and rank-based accuracy checks Apr 1, 2026

RAMitchell marked this pull request as ready for review April 1, 2026 11:39

RAMitchell requested a review from Copilot April 1, 2026 11:39

Copilot started reviewing on behalf of RAMitchell April 1, 2026 11:40 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

tests/cpp/common/test_hist_util.h Outdated Show resolved Hide resolved

src/common/quantile.h Outdated Show resolved Hide resolved

tests/cpp/common/test_quantile.cc Outdated Show resolved Hide resolved

tests/cpp/common/test_quantile.cc Outdated Show resolved Hide resolved

RAMitchell added 2 commits April 1, 2026 06:30

Address quantile review follow-ups

25488e3

Fix weighted MGPU quantile reference test

9dcaf91

RAMitchell requested a review from trivialfis April 1, 2026 14:11

trivialfis reviewed Apr 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use log-scaled quantile sketch budgets and rank-based accuracy checks#12129

Use log-scaled quantile sketch budgets and rank-based accuracy checks#12129
RAMitchell wants to merge 10 commits intodmlc:masterfrom
RAMitchell:cpu-distributed-quantile-logn-budget

RAMitchell commented Mar 25, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

trivialfis Apr 2, 2026

Uh oh!

RAMitchell Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		MAX_NORMALIZED_RANK_ERROR = 2.0
		MAX_WEIGHTED_NORMALIZED_RANK_ERROR = 14.0

Uh oh!

Conversation

RAMitchell commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What This Changes

Test Changes

Testing

Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

trivialfis Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

RAMitchell Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RAMitchell commented Mar 25, 2026 •

edited

Loading